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Abstract 

The Foreground-Background (FB) discipline, which gives service to the customer that has 
received the least amount of service, minimises the queue length for a certain class of heavy-tailed 
service times. In this paper we give an overview of the results in the literature on single-server 
queues with the FB discipline. 
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1 Introduction 

Queues with heavy tailed distributions are of growing importance, as they seem to be good models 
for internet traffic, see for example Crovella and Bestavros [S] and Taqqu et al. [35]. For such 
queues, First-in-first-out (FIFO) and other standard disciplines do not perform well, and instead 
one should consider service disciplines that are able to react properly on the arrival of very large 
jobs. The Shortest-Remaining-Processing-Time discipline (SRPT) provides an improvement, but 
it uses the exact size of a job, and this knowledge is not always available. Among the disciplines 
that do not use knowledge of the job size, the Foreground-Background (FB) discipline is optimal 
in the following sense: for a certain class of heavy-tailed distributions, FB stochastically minimises 
the queue length, see Theorem 13.11 below. In this paper we give an overview of the literature on 
queues with the FB discipline. 

The FB discipline works according to the following priority rule: priority is given to the job that 
has received the least amount of service. If there are n such jobs, for some n £ N, then they are 
served simultaneously, i.e., each of them is served at rate \jn. Alternatively formulated, if the age 
of a job is the amount of work it has received, then a server using the FB discipline always serves 
the youngest job (s). 

It is perhaps surprising that until recently the FB discipline has received little attention in the 
literature: almost all results on the FB queue date from after 1980. This lack of attention may 
have to do with the only relatively recent interest in queues with heavy-tailed characteristics, as 
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well as with the difficulties that arise in the analysis of the FB queue. The first important work 
on FB queues was done by Schrage [37j and Kleinrock ^H] in the 1960s and 1970s. They mainly 
studied the sojourn time of a job of size x, and derived its mean and Laplace transform. Around 
1980, Pechinkin |28j . Schassberger [HE] and Yashkov obtained expressions for the generating 
functional of the steady-state queue length. Yashkov j^J provided a survey of most results known 
at the time. At the beginning of the 1990's, Righter and Shanthikumar |32l |B8 proved optimality 
results for the transient queue length. Since 2000, there has been a host of new results, for example, 
on the distribution of the sojourn time (Borst et al. [H] and Nuyens et al. US])) the slowdown 
(Harchol-Balter and Wierman |16| . Rai et al. [SB)' an d the maximum queue length (Nuyens 25 ). 

In this paper we intend to give an overview of the (different types of) theoretical results obtained 
for the single server FB queue. A survey of this type does not exist in the literature. The survey 
articles jUJHSl by Yashkov on processor-sharing queues include results for FB queues, but they date 
back from 1987 and 1992. Since then, both the interest in FB queues and the number of results have 
strongly increased. As a consequence, most results in the present survey are not contained in the 
older surveys. In addition to existing results, this paper contains some new material. Furthermore, 
to indicate similarities or differences, we shall compare the results for FB with those for Processor- 
sharing (PS), SRPT and FIFO. We have chosen to focus on theoretical results for the M/G/l and 
GI/GI/1 queue. Numerical results like those in Rai et al. |31| are not discussed, nor are the 
studies of multi-level queues with an FB mechanism in Aalto et al. 1 . For readers interested in 
implementing FB, we refer to Rai et al. |29l I30| . 

This survey is organised as follows. Section [2] is devoted to the history of the FB model and the 
acronym FB, and a discussion of general notions and intuition for the FB queue. In Section [3] we 
describe the optimality results for the FB queue. The mean performance is considered in Section^] 
Section \5\ describes results on the stationary queue length, and the maximal queue length. Sojourn 
time asymptotics are the subject of Section El The survey is concluded with Section [7[ which is 
devoted to the slowdown. 

2 History of the FB model and used acronyms, Intuition and gen- 
eral notions 

Initially, in the second half of the 1960s, the term FB, or rather FB n , was used as an abbreviation 
for both Foreground- Background and Feedback queueing systems. These different names referred to 
the same model, see Schrage |37j . Coffman and Kleinrock [Hj, and the survey article on time-sharing 
models by McKinney [21]. The FB n queue with so-called quantum size q is a one-server queue with 
n states, or priority classes. This queue operates as follows. 

Upon arrival in the queue, a job enters the first (or highest priority) state. Within each priority 
state, the priority of jobs depends on their arrival time to that state, in a FIFO manner. Jobs 
are served one at a time and uninterruptedly for a time period of length q. After the server has 
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completed a job's service request in a certain state, a job from the highest (non-empty) priority 
state is selected for service. If a job does not leave the queue during its time in the kth state, it 
moves to state k + 1 (which has lower priority) and waits until it is served in that state. In the nth 
and final state, jobs are served only if there are no jobs in other states. In that final state, they are 
served until they leave the system. So, for example, FE>i=FIFO. 

The interest in the FB n model with n states and positive quantum size q has faded a bit. 
Instead, people have studied the limiting case where (first) n — * oo and (then) q — > 0. After 
Kleinrock devoted a section of |18j to this limiting case of the FB n model, the term Foreground- 
Background (FB) is generally used for this model, and so it is in this paper. We believe the term 
Foreground-Background is preferable over Feedback, since there is no real feedback in the limit 
q^O. 

To distinguish FB from the FB n model, some authors prefer to use the term Foreground- 
Background Processor Sharing (FBPS), FBqo or generalised Foreground-Background. Others have 
invented their own acronyms like LAS (Least Attained Service first) jHJ, LAST (Least Attained 
Service Time first) (321) |34j . or SET (Shortest Elapsed Time) [Z|. Furthermore, FB may be dis- 
guised as 'advantageous sharing of a processor' [2S] as well. Due to this cacophony of names, some 
results on FB queues are difficult to find in the literature. One of the goals of the present survey 
is to unite the FB world again, and to provide a clear overview of all available results. This should 
prevent that theorems are re-proved, sometimes even in a weaker form, like Theorem 2.1 in Feng 
and Misra [T5] . 

2.1 Intuition 

To get a feeling for the evolution of the queue under the FB discipline, let us consider what happens 
when a new job arrives to the FB queue. Since that job is (strictly) the youngest in the queue, it is 
served immediately. Now there are three scenarios. 1. The new job needs at least as much service 
as the age of the job(s) that was (were) preempted at his arrival. Then after some time the job 
joins a cohort, a groups of jobs with the same age. 2. The job needs less service than the age of the 
job that was preempted. Then the new job leaves the queue before joining the older cohort, and 
the server returns to the cohort that was preempted. 3. Before joining another cohort or leaving 
the queue, the new job is preempted itself by the arrival of another new job. 

Since a job with service time x is younger than x throughout its stay in the queue, it has 
priority over all jobs older than x. As a consequence, the time such a job spends in the system is 
the same as if all service times would be truncated at x, i.e., if all service times y would have value 
min{y, x} instead. Hence, in the FB queue, small jobs do not suffer from the presence of large jobs 
in their midst. The sojourn times of small jobs are therefore insensitive to the shape of tail of the 
service-time distribution. This property turns out to be very useful when studying sojourn times 
in the FB queue. It may be shown that the ratio of the expected time a job spends in the system 
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and its service time converges to 1 as the service time converges to zero, see (@J) below. 

The consequence of this quick service to younger jobs is felt by jobs with (very) large demands. 
These jobs are mostly served when no other jobs are present, see also the discussion below equality 
(jSJ. One of the main issues when studying FB queues is to determine the price that large jobs have 
to pay for the priority that FB gives to short jobs. 

2.2 Set-up 

Unless stated otherwise, we consider an M/G/l queue with arrival rate A, service-time distribution 
F, and generic service time B. Assume that the load p satisfies p = XEB < 1. If F has a density, 
we denote it by /. Let Q denote the stationary queue length, measured in number of jobs. The 
busy period length is denoted by L. An important role is played by queues with truncated service 
times B A x = min{£?, x]. The load p(x) of such a queue is given by p(x) = \E(B A x). The class 
V is the class of all service disciplines that do not use knowledge of the residual service times. So, 
for example, FB, PS, FIFO G £>, but SRPT ^ V. 

Finally, some results in this paper are stated in terms of certain classes of service-time dis- 
tributions, like DFR, and certain stochastic orderings. Readers unfamiliar with these notions are 
referred to the appendix for a discussion of these notions. 

3 Optimality of FB 

In this section we describe two optimality results for the transient queue length. Originally, they 
were proved in the classical GI/GI/1 setting. However, they hold true in the following more general 
set-up as well. 

Consider queues with the same arrival and service times, but with different service disciplines. 
Let Q-w{t) denote the queue length at time t in the queue with discipline tt. The sequence of arrival 
times may be any sequence, even deterministic. Furthermore, at time some jobs could be present, 
as long as the number of jobs present at time in the two queues is the same and their ages at time 
are (pairwise) equal. Finally, jobs may have received service prior to their arrival in the queue, 
i.e., they may enter the queue with a positive age. 

The first optimality result, proved in Corollaries 2.1.2 and 2.3.2 in Righter and Shanthikumar 
[3*2] . is in terms of the marginal distributions of the process {Q(t),t > 0}. It generalises the result 
mentioned in Yashkov that for DFR distributions, FB minimises EQ over all disciplines in T>. 

Theorem 3.1 Consider a GI/GI/1 queue. Let ir £ T>, and let a £ T> be a non-preemptive disci- 
pline. If the service-time distribution belongs to the class DFR, then for every t > 0, 

<9fb(*) <st QAt) < Qa{t). 
For IFR service times the inequalities are reversed. 
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Service-time distributions are often divided in heavy-tailed and light-tailed distributions, and so 
it is in this paper, for example in Section |f)] below. Theorem 13 . 1 1 indicates that for the FB queue an 
alternative approach could be used, namely by considering DFR and IFR distributions, although 
these two classes do not contain all possible distributions. As remarked in Appendix lAlbelow. there 
is a connection between these two divisions: some well-known heavy-tailed distributions are DFR, 
while some well-known light-tailed distributions are IFR. 

Using a stronger condition on the density of the service-time distribution, an optimality result 
for the law of the process {Q(t),t > 0} can be obtained. Theorem 13. D. 8 by Righter in |32| reads: 

Theorem 3.2 Consider a GI/GI/1 queue. Let ir £ T>, and let a £ T> be a non-preemptive disci- 
pline. If the service-time distribution has a log-convex density, then 

{QFB(t),t > 0} < st {Q w (t),t > 0} < st {Q a {t),t > 0}. (1) 

For service times with a log-concave density, the inequalities are reversed. 

Let us conclude with a few remarks on the proofs of Theorems 13.11 and 13.21 Theorem 13.21 
essentially already appeared in Righter and Shanthikumar but there it was formulated only 
in terms of the FIFO discipline. The proofs of Theorems 13.11 and 13.21 are given in discrete time 
only. The extension to continuous time is said to follow from a limit argument. However, the limit 
argument given in chapter 10 of [21] suggests that such an argument is far from trivial. 

A third optimality result, Theorem 3.14 in Righter et al. [31] states that if E[B — x\B > x] is 
increasing in x, then EQ^{t) < EQ n (t) for all t > and ir £ T>. However, the proof contains 
an error that cannot be immediately fixed, as was noted by Aalto et al. ^: the result is proved 
by considering the unfinished work of jobs with age less than x, but the proof does not take into 
account that this quantity makes a vertical downward jump when a job reaches age x. The proof 
of Lemma 2.4 in Feng and Misra |13j contains the same mistake. 

Finally, the FB discipline is in a certain sense opposite to non-preemptive disciplines (i,e., 
disciplines that do not interrupt the service of a customer once is has started) such as FIFO and 
LIFO: FB serves the youngest jobs, while non-preemptive disciplines give priority to the oldest job. 
This idea is illustrated by the optimality theorems. 

4 Mean performance 

In this section we give several results that describe the mean performance of the FB queue. We 
consider the mean queue length in the stationary queue, its heavy-traffic behaviour, the influence 
of variability in the service-time distribution, and the conditional expected sojourn time. 

Many of the results described in this survey illustrate the idea that for heavy-tailed distributions 
the FB discipline performs very efficiently. Many results indicate that for those distributions the 
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FB discipline is markedly superior to FIFO. Furthermore, we shall encounter the following perhaps 
counter- intuitive phenomenon: for heavy-tailed service times, the FB queue may behave better than 
for light-tailed service times (see e.g. Corollary 14.81 below) . 



4.1 Mean queue length 

The mean queue length in the M/G/l FB queue is given by the following expression, see Pechinkin 
|28| - although there the factor A in front of the integral is missing: 

Corollary 4.1 In the M/G/l FB queue, 



The same expression can be found by using Little's law in combination with Theorem 14.91 below. 
Using that EB < oo, it may be seen from ((21) that EQ < oo. In fact, Yashkov [13] gives the 
following equality for the expected queue length: 



In (jHJ), equality holds if and only if the service-times are deterministic. It is not surprising that 
deterministic service times maximise the mean queue length: remember that in the M/D/l FB 
queue, all jobs leave at the end of the busy period. For light-tailed service times, the behaviour of 
the queue under the FB discipline may be quite poor. Consider, for example, a busy period in the 
G/D/l queue. By the FB priority rule, none of the jobs is allowed to leave the queue before any 
other job. Hence, all jobs that arrive during a busy period leave the queue together at the end of 
the busy period. Kleinrock ^H] uses this example to emphasise the disastrous effect that using the 
FB discipline may have. 

Next we consider higher order moments of the stationary queue length. In the M/G/l PS queue, 
all moments of the stationary queue length Qp$ exist, since P(Qps = n) = (1 — p) P n for n > 0. 
Furthermore, by using the Pollaczek-Khinchin transform for the queue length, it may be seen that 
in the M/G/l FIFO queue, EQ^^q < oo if and only if EB n+l < oo. For higher order moments in 
the FB queue, Theorem 9.1 in [21] gives the following relation between the moments of B and Q. 

Theorem 4.2 In the M/G/l FB queue, if EB a < oo for some a > 1, then all moments of Q are 
finite. 

It is an open question whether EB < oo is enough to show that all moments of Q exist. 




(2) 
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4.2 Heavy traffic 

In this subsection we describe results on the heavy-traffic behaviour of the mean queue length in the 
stationary queue. Most of these results have the following form: for a certain class of service-time 
distributions, there is a constant 7 > such that EQ = 0((1 — p)~ 7 ) as p f 1. 

Let xf be the right end-point of the service-time distribution F, i.e., xf = sup{x : F(x) < 1}. 
It turns out that the mean queue length shows different heavy-traffic behaviour for xf = 00 and 
xf < 00. We start with the case that xf = 00. Recall that a function ij is called regularly varying 
at 00 of index a if lim^^oo n(tx)/n(x) = t a for all t > 0. 

Theorem 4.3 Let xp = 00. Then (1 — p)EQ = 0(1) for p f 1 if one of the following conditions 
holds: 

1. 1 — F is regularly varying at 00 of index a < — 1, 

2. 1 — F(x) = o(exp(— cx@)) for some f3 > as x — ► 00. 

This theorem was proved in |23]- By comparing the FB queue with the PS queue, and using 
Theorem 13.11 for DFR distributions we have (1 — p)EQ = 0(1) holds as well. 

Bansal and Gamarnik [3] obtained the following stronger results for Pareto distributions. 

Theorem 4.4 Let 1 — F(x) = (k/x) a ,x > k, for some a > 1. Then for pi 1, 
0(\o g (jL-)) if a < 2, 



EQ 



0(\og\^-)) if a = 2, 



a-2 



0((l-p)~~) if a > 2. 



By partial integration of (j2j), Yashkov [12] showed that EQ > — log(l — p) for p < 1. Combining 
this with Theorem 14.41 we find the following new result. 

Corollary 4.5 Let 1 — F(x) = (k/x) a ,x > k, for some 1 < a < 2. Then there exist C2 > c\ > 1 
such that 

(ci + o(l)) log (y^) <EQ< (c 2 + o(l)) log (j^) ' p T L 

In addition, a few other heavy-traffic results exist for the FB queue. The survey paper [131 quotes 
the following heavy-traffic limits from articles that appeared in Russian journals. Unfortunately, 
these could not be retrieved by the author of this survey. The following theorem is by Nagorenko 
and Pechinkin [22]. Recall that f(x) ~ g(x) for x — > 00 means that lirn^oo f(x)/g(x) = 1. 

Theorem 4.6 For service-time distributions with tail 1 — F(x) ~ ax b e~ cx , for some a > 0, b > 
and c > 0, the stationary queue length Q in the M/G/l FB queue satisfies 

limP(Q/EQ < x) = 1- e~ x , x>0. 
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For the M/D/l FB queue an expression for the Laplace transform of limpfi Q/EQ is given by 
Yashkov and Yashkova |44j . 



We now turn to the case that xp < oo. It turns out that the heavy-traffic behaviour of the 
mean queue length is different than for the case that xp = oo. 

Theorem 4.7 Let xp < oo. If 1 — F(x) ~ (xp — x)^rj for some constants (5, r] > as x — > xp, 
then for p | 1, 

(l-p)(/ 3 +W+ 1 ) J BQ = 0(l). 

Furthermore, for the M/D/l queue we have the following result. By combining the Pollaczek- 
Khinchin mean value formula and (jSJ), it can be seen that the queue lengths Qfifo an d Qfb in the 
M/D/l queue under FB and FIFO satisfy: 

£Qfifo p 2(1 -pf p{\ - p) 2 

4.3 Variability of the service-time distribution 

In this section we consider the effect on the queue length of more variability in the service times. 
For heavy-tailed service times the FB disciplines performs quite well. One may therefore wonder 
whether in the FB queue more variability in the service times could be beneficial to the behaviour 
of the queue; could it, for example, lead to a smaller mean queue length? In the literature, a few 
conjectures of this type exist. The survey paper Yashkov [IH] claims that in the stationary FB 
queue "EV decreases with an increase in the dispersion of F(x), and conversely increases as the 
dispersion of F(x) decreases." 

Furthermore, for or a random variable X with EX > 0, the coefficient of variation is defined 
as C(X) = y / Var(Y) /EX. Coffman and Denning (2j conjectured that an FB queue with generic 
service-time B and C(B) > 1 has a smaller expected sojourn time than a queue with generic service 
time B' with the same mean and C(B') < 1. Wierman et al. jlU] invalidated this conjecture. 

However, a result somewhat similar to the conjecture does hold. First note that by inequality 
for fixed p, the mean queue length in the FB queue is maximal for deterministic service times. 
These have coefficient of variation zero. By using that in the M/M/l queue all disciplines have the 
same queue- length distribution, we obtain the following corollary to Theorem 13.11 

Corollary 4.8 Consider four stationary M/G/l FB queues with arrival rate A and load p. The 
service-time distributions are DFR, exponential, IFR, and deterministic, respectively. Let Q DFR , 
Q exp , Q IFR and Q det denote the stationary queue lengths. Then 

EQ DFR < EQ exp = -E— < EQ IFR < EQ det = (2 ~ p) f . 

1-p * * 2(1 - pY 
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It can be shown that if X G DFR, Y S IFR, Z is deterministic, and EX = EY = EZ, then 
C(X) > C(Y) > C(Z). Hence, for special classes of service-time distributions, the queue with the 
larger coefficient of variation does have the smaller mean queue length. 



4.4 Conditional mean sojourn time 

Let V{x) be the (conditional) sojourn time of a job with service time x in the stationary M/G/l FB 
queue. By analysing the mean behaviour of the queue, Schrage |37j found the following expression 
for EV(x). 

Theorem 4.9 For all x such that p(x) < 1, the conditional sojourn time V{x) satisfies 

, XE(BAx) 2 x , A . 

A formula for EV can be derived by integrating this expression over all x w.r.t. the service-time 
distribution, see also P|). By differentiating @, Kleinrock ^Hj, obtained the following consequence 
of Theorem 14.91 under the condition EB 2 < oo. This condition was removed in |24| by using that 
E(B A x) 2 = o(x) and 1 - F(x) = o(l/x) if EB < oo. 

Corollary 4.10 The mean conditional sojourn time EV(x) satisfies 

Um ^ = J_. (5) 
ax 1 — p 



x— >oo 



This result may be interpreted as follows: during the sojourn time of an exceptionally large job, 
the service rate it gets is the total service rate (namely 1) reduced by the load of jobs that pass 
through the system in the meantime (p, in the limit). 

Combining Theorem 1 and Lemma 1 in Rai et al. |3 1 j yields the following result. 

Theorem 4.11 For all x, 

and equality holds if and only if P(B = x) = 1. 

Furthermore, Kleinrock 18_ gives an expression for the Laplace transform of V(x): 

EeM-sV(x)) = exp ( - x(s + A - \g x (s)))Eexp ( - (s + A - Xg x {s))W(x)) , (6) 



where W(x) is the stationary workload in the queue with generic service time B A x, and g x is 
the Laplace transform of the busy period in such a queue. By differentiation of these Laplace 
transforms, in [21] the following asymptotics for x — > oo are derived: 



x" I Oix 11 ' 1 ) if 3 a > 2 : EB a < oo, 

EV(x) = - . .. + { V J (7) 
(l-p(x)) n \o(x n+1 - a ) if 31< a < 2 : EB a < oo. 
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Since trivially V(x) > x, using Q yields the following new result. 

Theorem 4.12 For any n S N, we have EV n < oo if and only if EB n < oo. 

This theorem indicates that the tail behaviour of the sojourn time and the service time is in some 
sense similar. This observation is further illustrated in Theorem Ifi . 1 1 b elow . 

4.5 Overload 

Assume p > 1. Then the workload in a queue grows a.s. without bounds. Under some service 
disciplines, e.g., FIFO, still all jobs will eventually leave the system. In the FB queue, only jobs 
with service time below a critical value a.s. leave the system. Indeed, jobs smaller than x* , with 

x* = sup{x : p(x) = XE(B A x) < 1}, 

experience a system for which the stability condition p(x) < 1 holds, and hence they leave the 
queue a.s. Jobs with service time larger than the critical value have a positive probability of being 
in the queue forever. 

For the PS queue, an expression for the asymptotical growth rate is given by Jean-Marie and 
Robert ^7]. It is interesting to compare these two growth rates. Numerical calculations indicate 
that for some heavy-tailed distributions with much mass to the left of A -1 , the asymptotic growth 
rate under FB is smaller than under PS. However, no theoretical results have been obtained so far. 

5 The queue length 

In this section we describe the remaining results for the queue length. In Section f5.ll the maximal 
queue length is treated; in Section 1^1 we give transforms of the stationary queue length, and results 
for the cohort process. 

5.1 The maximal queue length 

Now we consider the maximal queue length in a busy period, M. The distribution of M in the 
M/D/l queue essentially already appears in Borel [3]: 

Theorem 5.1 The distribution of the maximal length M in the M/D/l FB busy period with arrival 
rate A and service times equal to 1 satisfies 

n-l 

P(M = n) = \ n - l e - Xn — - ~ e n(logA+1 - A 7(ra v ^Av / 27r), n -» cxd. 

Here f(n) ~ g(n) means that linin^oo f(n)/g(n) = 1. 

Nuyens proved that if the service times have a log-convex density, then the tail of M is 
bounded by an exponential tail: 
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Theorem 5.2 If the service-time distribution has a log-convex density, then 



P(M >n) < ft 



n = 0,l,.... 



Interestingly, given that the service-time distribution has a log-convex density, the upper bound 
p n for P(M > n) is insensitive to the precise form of the service-time distribution. Furthermore, 
one may wonder whether Theorem 15.21 implies that all moments of Q are finite for log-convex den- 
sities, see also Theorem 14.21 

By the regenerative structure of the queue length process, the maximal queue length during a 
busy period is related to the maximal queue length over the time interval [0,t] for t — > oo, see the 
survey article Asmussen j^j. Nuyens |25j used this idea to show the following. 

Theorem 5.3 Let M(t) be the maximal queue length over the time interval [0, t] of an M/G/l FB 
queue with i.i.d. service times with a log-convex density. Assume that p < 1. Then for any x > 0, 
the inequality 

P(M (t) > a log t + b + x) < p x , 

holds for t large enough, where a = — l/(logp), b = — (log A + log(l — p))/(logp) + 1. 

Using Theorem 15. 31 calculations in j^S] showed that for some heavy-tailed log-convex densities, the 
time to overflow of a buffer in the FB queue is of a different order of magnitude than in the FIFO 
queue. This illustrates the idea that in case of heavy-tailed service times, using the FB discipline 
instead of FIFO may increase the performance of the queue considerably. 

5.2 The stationary queue length 

Pechinkin obtained the following expression for the generating function of Q. 
Theorem 5.4 Let Q be the number of jobs in the stationary M/G/l queue. Then for z < 1, 



Yashkov obtained the counterpart of Q for the case of batch arrivals. From the proof of 
Theorem 15 .41 it follows that v(t, 1) = and that v is differentiable w.r.t. z. This allows for computing 
the moments of Q by differentiating 

Let Q x be the number of jobs younger than x in the stationary queue. Schassberger ^S] obtained 
the generating functional of the point process Q x . 




(8) 



where v(t,z) is the unique non-negative root of the equation 




0) 
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Theorem 5.5 For all functions h : [0, oo) — > [0, oo), 

£(exp(- [h(x)dQ x )) = (l-p)exp(- fz^M {W) 

J Jo OZ z=exp(-h(t)) 

where v(t, z) is again the unique nonnegative root of Jj^. 

The original proof of (|1U[) in Schassberger [SHI uses a discrete approximation and is quite tech- 
nical. Later, Robert and Schassberger [35] found a more direct way to prove (|1U|) . using Q in 
combination with the following result. 

Theorem 5.6 The process Q x has independent increments. 

The process Q x is integer- valued, but it has a jump larger than 1 at x when the cohort of age x 
contains more than one job. Hence, it is a non-homogeneous Poisson process with batch arrivals, 
see Daley and Vere- Jones [10] . Let K x be the number of cohorts in the stationary M/G/l FB queue 
consisting of jobs younger than x. The cohort process K x has only jumps of size 1. Combining this 
observation with Theorem 15.61 leads to the following new corollary. 

Corollary 5.7 The cohort process K x is a non-homogeneous Poisson process with intensity p(x) 
given by 

"M = -s^^w = °> = -s"*^- = 0) = -^ log(1 - pW) = Sf- 

6 The sojourn time 

In this section we describe the results obtained for the tail behaviour of the sojourn-time distri- 
bution. In Section 16.11 we discuss the case of heavy-tailed sojourn times, and in Section 16.21 we 
consider light-tailed service times. 

6.1 Asymptotics: tail equivalence for heavy tails 

For a broad class of heavy-tailed service times, Nunez Queija 23 obtained asymptotics for the tail 
of the service time distribution. Recently, Nuyens et al. j^H] generalised this result to the GI/GI/1 
setting. A key role is played by the following class of functions. A function h is said to be of 
intermediate regular variation at infinity, if 

liminf liminf ^LJ?L ^~ g ^ = \_ 
e[0 x— >oo h(x) 

Theorem 6.1 Suppose that 1 — F is of intermediate regular variation at infinity. If EB a < oo for 
some a > 1, then 

P(V>x)~P(B>(l-p)x). (11) 
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Theorem 16,11 makes sense intuitively: since jobs under FB are only served if no younger jobs are 
present, very old jobs most likely only get service if no other jobs are present. Hence, long jobs are 
served as if they were alone in a system with service rate 1 — p. This phenomenon is often called 
the reduced load approximation. See also the remark after equation ©. Furthermore, Theorem 16. II 
supports the remark below Theorem 14.121 

Nunez Queija [23] showed that Theorem 16 . II holds for M/G/l PS and SRPT as well. Guillemin 
et al. ^1] showed that relation (|11|) holds for a wide class of processor sharing queues. For an 
overview the tail behaviour of V under several other disciplines in case of regularly varying service 
times, see Borst et al. [§]. 

It is an interesting open problem whether there exists a service discipline tt and some constant 
1 — p < a < 1 such that in the M/G/l queue with heavy-tailed service times, 

P(V n > x) ~ P(B > ax). (12) 

If such a discipline does not exist, then the FB queue has the perhaps counterintuitive quality that 
although large jobs are discriminated, large sojourn times are as unlikely as possible. Another 
interesting question is how (|12[) relates to the results in Baiter et al. JH]) who study the behaviour 
of V(x)/x for large x. These results are discussed in Section |7| below. 

Finally, let us compare the tail behaviour of the sojourn time under FB and the busy period in 
the M/G/l queue. Let r\ be a regularly varying function at oo of index a < — 1. Let L denote the 
busy period length. De Meyer and Teugels proved that, as x — > oo, 

P(B>x)~ v (x) ^ P(L > x) = (1 - p)- a - l rj{x). (13) 

Since r){x) is also of intermediate regular variation at oo, we have by Theorem 16. II and (|13[). 

P(V > x) ~ P(B > (1 - p)x) ~ r/(x)((l - p))~ a ~ (1 - p)P(L > x). 

Hence, in this case the tails of V and L are asymptotically equal up to a multiplicative constant. 
This property is shared with PS and SRPT, but it does not hold for FIFO. 

6.2 Asymptotics: light-tailed service times 

For light-tailed distributions, the behaviour of the sojourn-time distribution is formulated in terms 
of the following quantity. 

Definition 6.2 (Decay rate) The (asymptotic) decay rate of a random variable X is defined as 
7(X) = - lim 

i-too 1 log P{X > x), provided the limit exists. 

Mandjes and Nuyens |19j studied the decay rate of the sojourn time in the M/G/l queue. Nuyens 
et al. obtained the following generalisation. 
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Theorem 6.3 Consider a GI/GI/1 FB queue. Let L be the length of a busy period. Assume that 
E(exp(KB)) < oo for some k > 0. Then j(V) exists and 



where <1> b and &a are the generating functions of B and the generic interarrival time A, and Q A 
is the inverse of<&A- 

From the proof of Theorem 4.1 in Nuyens and Zwart [27j, it follows that under any work- 
conserving discipline, i.e. a discipline that serves at rate 1 as long as there is work in the queue, 
j(V) > j(L), if the service times with an exponential moment. Hence, for service times with an 
exponential moment, the FB discipline minimises the decay rate of the sojourn time in the class of 
work-conserving disciplines. 

Contrasting Theorems 13.11 and 16.31 we observe the following interesting phenomenon: for 
gamma-distributed service times with a log-convex density, see Appendix the FB discipline 
minimises the queue length, but the sojourn time has the smallest possible decay rate. Hence, 
optimising one characteristic in a queue may have an ill effect on another characteristic. 

We shortly discuss the decay rate of V under some other service disciplines. Mandjes and Zwart 
20 considered the GI/GI/1 PS queue with light-tailed service requests. They show that 7(Vps) 
is equal to j(L), under an additional condition that rules out distributions with bounded support, 
or extremely light tails. Nuyens and Zwart 27 show that 7(Vsrpt) equals j(L) in the GI/GI/1 
SRPT queue, unless there is mass in the endpoint xf of the service-time distribution. Finally, it 
may be seen that the decay rate 7(Vfifo) °f the sojourn time in the FIFO system is strictly larger 
than 7 (L), see also |20ll2?|. 

For the conditional sojourn time, Nuyens et al. j2H] showed the following result. 

Theorem 6.4 For all x, 7(V(a:)) = j(L x ), where L x is the busy-period length in the queue with 
generic service time B A x. 

7 Slowdown 

We conclude the paper by considering another performance measure for queueing policies, the so- 
called slowdown. The slowdown is a way to measure how fair jobs are treated by a service discipline, 
and is defined as follows. 

Definition 7.1 (Slowdown) The slowdown S(x) of a job of size x is defined by S(x) = V(x)/x. 
The slowdown S is defined as S = S(B), where B is the generic service time, independent of S{x), 
and we may write P(S > x) = J P(S(u) > x)dF(u). 

Lemma 14 in Nuyens et al. |26j implies the following result for the asymptotic behaviour of the 
slowdown. 
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Theorem 7.2 In the FB queue, if p < 1, then S(x) a.s. converges to 1/(1 — p) as x — > oo. 

Harchol-Balter et al. |15| proved that in the M/G/l queue, lim^—Hx, S(x) < 1/(1 — p) a.s. Hence, 
the asymptotic slowdown under FB is maximal, just as it is under PS. 

The intuitively appealing idea 'the larger the service request of a job, the larger his slowdown 
under FB ' was recently invalidated by Harchol-Balter and Wierman |16j . 

Theorem 7.3 The mean slowdown ES(x) is not monotonically increasing in x. In fact, ES(x) 
converges from above to 1/(1 — p) as x — > oo. 

An important conclusion is that when one uses the slowdown as a measure of fairness, not the 
longest jobs are treated most unfairly, as is often believed, but certain 'medium long' jobs. 

Some studies have been done to compare the slowdown in the FB queue with the slowdown in 
other queues. Let S*fb and Sps denote the slowdown in two M/G/l queues with the same arrival 
rate and service-time distribution. Theorem 2 of Rai et al. [HJ reads 

ESpb < - — ^-ESps ~ 



2p 2(1 -pf 

Feng and Misra ^3] show that for DFR service-time distributions, the FB discipline minimises the 
expected slowdown over the class T>. For more results on the slowdown and (un)fairness we refer 
to Rai et al. [3TJ and Bansal and Wierman [I]. 

Rai et al. [3TJ compare, through numerical evaluation, the slowdown under FB, PS and SRPT 
for service times with the so-called high-variability property. For such service-time distributions 
less than 1% of the jobs accounts for more than half the load. According to recent studies internet 
traffic exhibits this property, see Crovella and Bestavros 9\. The numerical study in [3TJ shows 
that for service times with the high-variability property, FB is quite close to SRPT. Furthermore, 
a very large percentage of the jobs has a significantly smaller slowdown under FB than under PS 
or FIFO. 
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A Classes of distributions and stochastic orderings 

In this appendix we give an explanation of the classes of distributions and stochastic order relations 
that are used in this paper. 

A distribution F with density / belongs to the class DFR (decreasing failure rate), if its failure 
(or hazard) rate, f(x)/{\ — F(x)), is decreasing for x > 0. The class IFR (increasing failure rate) 
is defined analogously. Alternatively, one may define the failure rate only for x in the support of 
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F. However, as remarked by Down and Wu |12| . this alternative definition is not good enough to 
prove the optimality for FB in Theorem 13. II below, and hence it will not be used here. 

A density / is called log-convex if log / is convex. By integration, one can show that the class 
of log-convex densities is a subclass of DFR. The class of distributions with a log-convex density 
includes many well-known distributions, for example Pareto distributions, gamma distributions with 
density f(x) = A"x n_1 exp(— Xx)/F(n), A > l,x > 0, and Weibull distributions with distribution 
function F(x) = 1 — exp(— ax@), (3 < l,x > 0. Densities are called log-concave if log / is concave. 
For more results on these classes of distributions we refer to Shaked and Shanthikumar |38j . 

Recall that a random variable X is stochastically smaller than Y, notation X < s t Y, if P(X < 
x) > P(Y < x) for all x. As a consequence, we can find X' = X and Y' = Y such that P(X' < 
Y') = 1. In fact, this characterisation is equivalent to the definition of < S f. The stochastic ordering 
of processes is a generalisation of stochastic ordering for random variables, and can be defined 
similarly, see also Section 4.B.7 of Shaked and Shanthikumar |38| : we say that two processes 
{X(t),t > 0} and {Y(t),t > 0} are stochastically ordered, notation {X(t),t > 0} < st {Y(t),t > 0}, 
if there exist processes {X(t),t > 0} and {Y(t),t > 0}, defined on an common probability space, 
such that P(X(t) < Y(t) Vt) = 1 and 

{X(t),t > 0} = {X(t),t> 0},{Y(t),t> 0} = {Y(t),t> 0}. 
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